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CONTROLLED MOVING WINDOW ADAPTIVE HISTOGRAM 

EQUALIZATION 

CROSS REFERENCES TO RELATED APPLICATIONS 

[0001] This application is related to United States Patent Application Serial No. 
<xxxx> entitled "IMAGE CLASSIFICATION USING CONCENTRATION RATIO," 
filed January fly 2004, and assigned to the assignee of the present application. 

STATEMENT REGARDING FEDERALLY SPONSORED 
RESEARCH OR DEVELOPMENT 

[0002] None. 

REFERENCE TO SEQUENTIAL LISTING, ETC. 

[0003] None. 

BACKGROUND 

1. Field of the Invention 

[0004] The present invention relates to image processing, and more particularly to 
methods and systems for reducing artifacts that are caused by image processing. 

2. Description of the Related Art 

[0005] There are a variety of existing image-processing systems and techniques. 
Some of these existing systems and techniques use histogram equalization to expand the 
dynamic range and contrast of an image. In a technique known as adaptive histogram 
equalization ("AHE"), the image is divided into small portions called 'tiles," and 
histogram equalization is applied to each tile by using a look-up table. AHE can 
improve the contrast of images that are partially overexposed and/or partially 
underexposed. 
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[0006] In addition to image processing to improve image quality, image 
classification may also be of interest in various circumstances. For example, in printing 
or copying, various settings (e.g. halftone mode, color table, etc.) can be selected 
automatically if the image content or class of an image (e.g., text, graphic, photograph, 
etc.) is known. 

[0007] Although AHE is relatively effective in fixing exposure problems in an 
image, it also introduces unwanted, tiling artifacts. In addition, if AHE is mistakenly 
applied to a properly exposed image, the process boosts contrast to an undesirable level 
and the resulting image is worse than the original. Accordingly, there is a need for 
improved methods and systems for processing images. 

SUMMARY OF THE INVENTION 

[0008] Some embodiments of the invention provide mechanisms for applying 
moving-window, AHE in a controlled manner. In some embodiments, the control 
technique has two main aspects. First, it allows a user to select a degree of enhancement 
(or a control parameter) to his satisfaction. Thus, it does not always enforce the 
maximum contrast. Second, the control parameter is also adjusted based on an image's 
local characteristics to minimize graininess. The control parameter may be based on a 
concentration ratio of an image. 

[0009] In one embodiment, the present invention provides a method of processing 
an image. The method may include breaking the image into a plurality of tiles; and 
processing each of the plurality of tiles; determining a histogram for one of the plurality 
of tiles; determining a concentration ratio for the one of the plurality of tiles; 
determining an area of the image that includes the one of the plurality of tiles and 
information outside the one of the plurality of tiles; creating a first output by performing 
a histogram equalization on the area; creating a second output based on a control 
parameter and the first output; and using the second output to process the one of the 
plurality of tiles 

[0010] In another embodiment, the invention may include a system for processing 
an image. The system may include a processor configured to break an image into a 
plurality of tiles and process each of the plurality of tiles; determine a histogram for one 
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of the plurality of tiles; determine a concentration ratio for the one of the plurality of 
tiles; determine an area of the image that includes the one of the plurality of tiles and 
information outside the one of the plurality of tiles; create a first set of values by 
performing a histogram equalization on the area; create a second set of values based on 
5 a control parameter and the first output; and use the second set of values to process the 
one of the plurality of tiles. 

[0011] In yet another embodiment, the invention may include an image processing 
system that includes an image capture device operable to output an image and a 
controlled, equalization processor coupled to the image capture device. The processor 

10 might be configured to break the image into a plurality of tiles and process each of the 
plurality of tiles by determining a concentration ratio for the one of the plurality of tiles, 
obtaining a control parameter, determining an area of the image that includes the one of 
the plurality of tiles and information outside the one of the one or more tiles, creating a 
first output by generating a histogram of the area, creating a second output based on the 

1 5 control parameter and the first output, and using the second output to process the one of 
the plurality of tiles. 

[0012] In still another embodiment, the invention might include a method of 
processing an image. The method may include capturing an image of an object, and 
applying controlled, equalization to an image generated by the image capture device, 
20 where the controlled, histogram equalization uses a concentration ratio. 

[0013] Further still, another embodiment of the present invention could include a 
computer-readable medium containing instructions for processing an image. The 
instructions for processing the image might include those that break the image into a 
plurality of tiles, and process each of the plurality of tiles. Each of the plurality of tiles 

25 might be processed by obtaining a control parameter, determining a histogram for one of 
the plurality of tiles, determining a concentration ratio for the one of the plurality of 
tiles, determining an area of the image that includes the one of the plurality of tiles and 
information outside the one of the plurality of tiles, creating a first output by performing 
a histogram equalization on the area, creating a second output based on the control 

30 parameter and the first output, and using the second output to process the one of the 
plurality of tiles. 
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[0014] Other features and advantages of the invention will become apparent to those 
skilled in the art upon review of the following detailed description, claims, and 
drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 

5 [0015] The patent or application file contains at least one drawing executed in color. 
Copies of the patent or patent application publication with color drawings(s) will be 
provided by the Office upon request and payment of the necessary fee. 

[0016] In the drawings: 

[0017] Fig. 1 is an illustration of a controlled, moving- window processor. 

10 [0018] Fig. 2 is an illustration of a computer system that may be used in 
embodiments of the invention. 

[0019] Fig. 3 is a flow chart illustrating image processing. 

[0020] Fig. 4 is another flow chart illustrating details for a portion of the process 
illustrated in Fig. 3. 

15 [0021] Fig. 5 is an illustration of shifting tiles and windows in an image. 

[0022] Figs. 6A-6C illustrate an unprocessed color image, a color image processed 
using equalization, and a color image processed using controlled, moving-window 
equalization processing, respectively. 



[0023] 


Fig. 


7 illustrates an exemplary gray scale photograph. 


[0024] 


Fig. 


8 is a histogram of the photograph in Fig. 7. 


[0025] 


Fig. 


9 is an illustration of a text-only image. 


[0026] 


Fig. 


10 is a histogram of the image in Fig. 9. 


[0027] 


Fig. 


1 1 is an illustration of a classification processor. 
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[0028] Fig. 12 is a flowchart illustrating a process for determining an image 
classification. 

[0029] Fig. 1 3 is a flow chart illustrating additional details of a portion of the 
process shown in Fig. 12. 

5 [0030] Fig. 14 is a schematic of a system that combines features of image 
classification and controlled equalization. 

[0031] It is to be understood that the invention is not limited in its application to the 
details of construction and the arrangement of components set forth in the following 
description or illustrated in the drawings. The invention is capable of other 

1 0 embodiments and of being practiced or of being carried out in various ways. Also, it is 
to be understood that the phraseology and terminology used herein is for the purpose of 
description and should not be regarded as limiting. The use of "including," 
"comprising," or "having" and variations thereof herein is meant to encompass the items 
listed thereafter and equivalents thereof as well as additional items. Unless limited 

1 5 otherwise, the terms "connected," "coupled," and "mounted," and variations thereof 
herein are used broadly and encompass direct and indirect connections, couplings, and 
mountings. In addition, the terms "connected" and "coupled" and variations thereof are 
not restricted to physical or mechanical connections or couplings. 

DETAILED DESCRIPTION 

20 [0032] Before embodiments of the invention are described in detail a brief review of 
histogram equalization and moving window AHE is provided. 

[0033] In ordinary histogram equalization, an attempt is made to make the 
histogram uniform by redistributing gray or color levels, depending on whether black 
and white or color images are processed. The following discussion is based on gray 
25 scale images with 256 levels (enumerated 0-255). If color images are processed, either 
each color channel (e.g., red ("R"), green ("G"), and blue ("B")) is treated separately or 
the luminance channel is used. 

[0034] For gray scale images, the redistribution performed through histogram 
equalization, for an input / and producing an output Oy, may be described by: 
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Ox = 256 x ( / * ^ J"" 1 (Equation 1) 

[0035] In Equation 1 , N x and N y are an image width and height in pixels, 
respectively, and P L is the population of pixels at a level L. The output value O/ for 
each input value is stored in a look-up table. The look-up table is used to transform the 
10 image. 

[0036] To reduce artifacts, a moving window technique may be added to histogram 
equalization. In moving- window AHE the image is divided into tiles, but the look-up 
table is made using a bigger window. The bigger window is selected in such a way that 
there is an overlap between windows used for neighboring tiles. This overlap ensures 
1 5 that look-up tables used in neighboring tiles aren't drastically different. This produces a 
smoother transition over tiles. However, this method still has some deficiencies. 

[0037] In some embodiments of the invention, a means or mechanism to control the 
degree of enhancement in AHE is provided. The mechanism may be a control 
parameter. In one embodiment, a control parameter, C is used and calculations are 
20 performed using a modified version of Equation 1. In some embodiments, C can take 
values from 0 (minimum) to 1 (maximum). A modified output O2 may be given by: 

25 <9 2 = /+ (O, -/ )*C (Equation 2) 

[0038] As one can see by reference to Equation 2, when C is at its minimum (0), the 
output is the same as the input. When C is at its maximum (1), then Equation 2 reduces 
30 to Equation 1 . Thus, by changing the control parameter C from 0 to 1 the degree or 
amount of enhancement can be varied from no-effect to full-effect. In some 
embodiments, the control parameter C is specified by the user (e.g., is input via a 
graphical user interface), is determined by the tile characteristics, or both. 

[0039] The inventors have determined that when tile characteristics control the 
35 degree of enhancement (e.g., in contrast) of an image, excessive graininess in the 
processed image may occur. In general, grainy images are produced when what is 
called a "smooth" image (i.e., an image with few gray levels) is processed by histogram 
equalization. 
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[0040] As will be discussed in greater detail, in some embodiments of the invention, 
when a smooth image or, more specifically, a smooth tile is detected, the amount of 
equalization or enhancement is reduced. Smoothness of an image or specific tile may be 
determined using a concentration ratio ("CR"). A CR is a number that indicates how 
concentrated or widespread the population distribution of a histogram is. Generally, if 
the population is distributed evenly across all levels, the CR is a large number. 
Likewise, if the entire population is concentrated at a few levels, the CR is generally a 
small number. In a histogram where Pi is the population at level L, the CR may be 
described by: 




(Equation 3) 



[0041] When using Equation 3, the CR for an image or tile is greater than 1 . For 
example, for a tile having a size of 16 x 16 pixels, the CR ranges from 1 to 16. In 
embodiments of the invention, the CR may be scaled to range between 0 and 1 . 
Continuing the prior example for a 16 X 16 tile, the CR can be divided by 16 to make it 
range between 1/16 and 1. 

[0042] The CR (usually after it is scaled) may be used as the control parameter C. 
Alternatively, the CR may be combined with a parameter (referred to herein as "UI") 
provided by a user (for example, through a dialog box presented by a graphical user 
interface on a computer or display panel for a printer or multifunction device) and the 
combination of the CR and the UI may be used as the control parameter C. In addition, 
the CR, UI, or both may be modified before they are combined or used as the control 
parameter C. Modification of the CR, UI, or combination may be performed to balance 
the dominance of CR and UI in overall control. Modification of the parameters can be 
accomplished in a variety of ways: e.g., by dividing or multiplying the parameters by a 
number or by taking the square or cube root, etc. of the parameters. 

[0043] Regardless of how the control parameter C is ultimately determined, 
embodiments of the invention use the control parameter C in Equation 2 to generate a 
look-up table. The look-up table may, in turn, be used to transform the current tile. 

[0044] Fig. 1 illustrates a processing system 10 where image data 12, which may 
include an image, is input into an image processor 14. The image processor performs 
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processing as described herein and generates a processed image or processed image data 
16. The image processor may be implemented with a variety of hardware and software. 
For example, a general-purpose, personal computer 17 (Fig. 2) having a central 
processing unit 18 ("CPU"), solid-state memory (not shown), a disk drive (also not 
5 shown), running operating system software (such as Microsoft Windows, also not 

shown) and input and output devices, such as a keyboard 19, a mouse, and monitor 20, 
may be programmed in accordance with the teachings herein. Image data from, for 
example, a digital camera 21 may be downloaded to the computer 17 and processed. 
The processed image data 16 may be displayed on the monitor 20, printed using a 
10 printer 22, or displayed on a display panel incorporated into a printer or multifunction 
device. Image data may also be provided by a scanner 23. 

[0045] Software implementing embodiments of the invention may be created as an 
application program executed under the control of the operating system. Other 
embodiments could be implemented using application specific integrated circuits, digital 
1 5 signal processing devices, and other hardware. 

[0046] As shown at step 25 in Fig. 3, the processor 14 retrieves an image either from 
an image capture device (such as the camera 21 or a scanner) or from memory (e.g., in 
the CPU or the disk drive) and breaks the image into small tiles at step 27. Techniques 
for breaking images into tiles are known in the art. For example, the technique 
20 described below may be used in embodiments of the invention. 

[0047] An exemplary image that might be retrieved in step 25 could have a width of 
300 pixels and a height of 200 pixels and, therefore, include a total of 60,000 pixels. 
Assuming for purposes of this example that the image is also a 256 gray scale image, 
then each pixel can have a gray value between 0-255. Generally, image data, for 
25 example, data in a JPEG file, is available or can be readily converted to a two 
dimensional array, which in pseudo code could be expressed as an array named 
"imageData" having a "width" and a "height" as follows: imageData [width] [height]. 

[0048] To divide the image into small blocks or tiles, the image may be scanned 
using programming loops along the width and height with increments in the width and 
30 height made in chunks. In an embodiment where tiles or blocks measuring 10 by 10 
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pixels is implemented, and the images being processed have a width of 300 pixels and a 
height of 200 pixels, software could be implemented in accordance with the follow 
code, where the variable x corresponds to a position along the width of the of tile and 
the variable y corresponds to a position along the height of the tile. 

5 

For(x=0; x<300; x+= 10) //Scan the image along width. Increment x by 10 each 
time. 

{ 

For(Y=0; y<200; y+= 10) //Scan the image along height. Increment y by 10 
10 each //time. 

{ 

ProcessCurrentBlockQ; //Call a processing method or procedure for 

//whatever processing is needed for the 
current 

15 //block. 
} 

} 

[0049] Each tile in the image is processed as shown at step 29 (for example in the 
processing method or procedure "ProcessCurrentBlock"). The processing that occurs at 

20 step 29 will be described in greater detail below. At step 31, the processor 14 checks to 
see if all the tiles in the image have been processed. If more tiles need to be processed, 
step 29 is repeated, as shown by loop 30. If all the tiles have been processed, the 
procedure ends, as shown at step 33. The fully processed image or image data 16 may 
be stored in memory or output to the monitor 20, the printer 22, or another device. The 

25 order of the processing may also be reversed or the origin used to begin the processing 
may be moved to another point on the image. For a color image, a similar process 
would be followed for each of the RGB color channels or the luminance channel. 

[0050] Figs. 4 and 5 provide additional details regarding the processing step 29. As 
shown in Fig. 4, in some embodiments, the processing step 29 includes fetching or 

30 otherwise acquiring the UI (for example, by retrieving the same from memory or from 
querying a user to input the same) as shown at step 40. Next, a histogram of the tile is 
generated as shown at step 42. Histograms may be generated using a variety of known 
techniques. An exemplary technique is illustrated in the pseudo code described below, 
where exemplary image data is again assumed to be provided in the form of a two 

35 dimensional array such as "imageData [width] [height]." 
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Int hist[256]; //A single dimension array that contains the population of each gray 

value. //For example, if hist[5] = 200 then there are 200 pixels of gray at 
value 5. 

5 Int level; //An intermediate variable that keeps the gray value of the current pixel. 

For(I=0; I<256; 1++) //For all levels assign an initial population of zero. This is called 

//histogram initialization. Increment through each level as the 
image is //scanned. 

10 } 

hist[I] = 0 

} 

For(x=0; x<300; x++) //Scan the image along its width. 

15 { 

For(y=0; y<200; y++) //Scan the image along its height. 

{ 

level = imageData[x][y]; //Gray value of current pixel. 
hist[level] = hist[level]+l ; //Increment the population. 

20 } 
} 

[0051] After the above loop ends, the histogram is complete. If implemented 
properly, the total number of histogram values should be the same as the total 
population or image size (i.e. for this example, 200*300 = 60000). Again, histograms 
25 for each of the RGB color channels or the luminance channel may be generated when 
processing a color image. 

[0052] After the histogram for the tile is determined, the CR for the tile is 
determined at step 44. In some embodiments, the CR is determined using Equation 3 
and the scaling described above. Other ways of determining a CR are described below. 
30 The UI parameter and CR may be used to generate a final control parameter as shown at 
step 46. 

[0053] As shown in step 48, in some embodiments the processor 14 can also 
determine a histogram of a larger region of image data that includes the tile being 
processed. This can be best understood by reference to Fig. 5. 

35 [0054] Fig. 5 includes a representation of image data 50 that includes a first tile 52 
and a first outer window 54. A histogram equalization look-up table for the first outer 
window 54 is generated, but applied to the first tile 52. The tile and outer window are 
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shifted (as shown in phantom by a second tile 56 and a second outer window 58), and 
the process is repeated. (Through the loop 30 shown in Fig. 3.) 

[0055] On an individual-tile 52 and window 54 basis, a histogram of window 54 is 
determined, as shown at step 48. A first look-up table is then generated using, for 
5 example, Equation 1 , as shown in step 62. Next, Equation 2 may be used to create a 

second look-up table, as shown in step 64. The second look-up table may then be used 
to transform the tile (for example, tile 52), as shown at step 66. The processing of a 
single tile is then complete as shown at step 68. As noted above, the overall process 
continues until all tiles are processed (Fig. 3). 

10 [0056] Figs. 6A-6C illustrate the effects that application on an embodiment of the 

invention may have on an underexposed color image 100 (Fig. 6A). Fig. 6B illustrates a 
modified version of the color image 100, image 102. Image 102 was generated using a 
moving-window, AHE technique. Fig. 6C illustrates another modified version of the 
color image 100, image 104. Image 104 was produced using a controlled, moving- 

1 5 window, AHE implementation of an embodiment of the invention. As can be seen, in 
color image 100 a group of people 1 10 is obstructed by a shadow 1 12 in the image. In 
image 102, the group of people 1 10 is visible, but equalization has increased the 
graininess of the image 102 to a point where the image loses a life-like appearance or 
authentic aesthetics. In some ways, the image 102 looks like the product of animation 

20 rather than an accurate representation of an actual event captured authentically by a 

camera. In the image 104, the obstruction of the group of people 1 10 by the shadow 1 12 
is reduced, but the image does not exhibit excessive graininess and maintains an 
appearance of authenticity. 

[0057] As discussed above, a histogram of a gray scale image generally represents 
25 the population of each gray level in the image. Although a histogram does not, in 
general, provide spatial information, it may still reveal many image features. For 
example, a typical text-only image has only two levels 0 (black) and 255 (white). As a 
consequence, the histogram for such an image will have only two levels populated. On 
the other hand, a typical photograph will have many and usually most of the levels 
30 populated. Fig. 7 illustrates an exemplary gray scale photograph 125 and Fig. 8 

illustrates a histogram 127 for the photograph. As can be seen, the histogram 127 is 
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populated at numerous levels with a higher concentration at a level LI (100 on the scale 
on the x-axis). Fig. 9 illustrates a text only image 130. A histogram 132 for the text 
only image 130 is shown in Fig. 10. This histogram 132 is populated at a two levels, L2 
(0 on the scale of the x axis) and level L3 (255 on the scale on the x-axis). 

5 [0058] It would seem, particularly based on the information in Figs. 8 and 10, that 
distinguishing between photographic and text images would only require counting the 
number of levels that are populated in a histogram. For example, it might be assumed 
that if only two levels are populated, then the image being classified is a text image. 
However, this simple method does not work in many instances. In some cases, a text 
10 image may not have exactly two levels. Some reasons for this include: edges of the text 
may have been stuffed with medium gray levels to reduce aliasing; and there may be 
noise or background data in the image. The inventors have discovered that the CR of an 
image may be used to distinguish noise from real data and to improve the accuracy of 
classification. 

15 [0059] As noted above, the CR of an image may be determined using Equation 3. If 
the population is distributed evenly across all levels, the CR will generally be a large 
number. If the population is concentrated at a few levels, the CR will generally be a 
small number. An example illustrates this. First, assume there are 256 possible levels 
and the total population (or total number of pixels) in the image being analyzed is 5 12. 

20 Second, calculate the CR for two extreme cases. In the first case, suppose the 

population is evenly spread across all levels, i.e., each level has two pixels. In this case, 

CR = (2 + 2 + 2 . . . + 2) 2 /(2 2 + 2 2 + 2 2 + . . . + 2 2 ) 
Ci? = (256x2) 2 /(256x 2 2 ) 
25 CR = 256 

In the second case, suppose the population is concentrated at one level, i.e., one level 
has 512 pixels and all other levels have no pixels. In this case, 

30 CR = (0 + . . . + 0 + 512 + 0 + . . . + 0) 2 /(0 2 + . . . 0 2 + 512 2 + 0 2 . . . + 0 2 ) 

CR = (1 x512) 2 /(l x512 2 ) 
CR= 1 

12 
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For any other population distribution for this example, the CR will generally be a 
number between 1 and 256. For example, if the population is distributed equally among 
half the levels, then the CR will be 128. 

[0060] The concentration ratio is generally unaffected by noise, which can be 
5 illustrated by another example. Suppose the second case above is the ideal case and the 
CR was expected to be one. However, due to some error 128 pixels (out of 512) were 
registered at different levels. So, 128 levels are singly populated and one level has a 
population of 384. In this case, 

10 CR = (0 + 0 + ... + 0+ l+ ...+ l + 384) 2 /(0 2 + 0 2 + . . . + 0 2 + l 2 + . . . + l 2 + 

384 2 ) 

CR = (128 x 1 + 1 x 384) 2 /(128 + 384 2 ) 
CR= 1.78 

1 5 This value is still very close to one, especially considering the possible range for the CR 
(1-256) and the amount of noise that was added. 

[0061] CR is also substantially invariant with respect to the location of the 
concentration. For example, it generally does not matter whether the population is 
concentrated at level 254 or at level 253, the CR will remain same. This characteristic 
20 can be useful in some circumstances, particularly when the number of levels populated 
is more important than their locations. 

[0062] In the examples above, the CR was discussed using numerical examples. 
However, the numbers should not be taken too literally, as they will change under 
different circumstances. For example, if the possible number of levels is not 256 then 

25 the range for the CR will not be 1-256. In addition, if the image is colored then the 
luminance channel is typically used to distinguish between text, graphic, and 
photographic images. (The color or gray scale identification can be made using a 
variety of known techniques including analyzing header information that may be 
available in an image file, such as a JPEG file.) Furthermore, the exponents in Equation 

30 3 are only exemplary ones. They can be replaced by any number that is more than one 
and still provide useful information. For example, the following equation can be used. 

13 



Attorney Docket No: 2003-0485.02 




(Equation 4) 

Regardless of the exact technique used to determine the CR of an image, once the CR is 
found, it may be used to classify an image or portions thereof. For example, the CR 
may be compared with different thresholds for classification as shown below. 

5 

If (CR < T) Image Type = Text (Equation 5) 

If (T < CR < P) Image Type = Graphic (Equation 6) 

If (P < CR) Image Type = Photographic (Equation 7) 

10 [0063] The thresholds in Equations 5, 6, and 7 (T and P) may be determined through 
a training process. For example, the CR of numerous images (e.g., 100's of images with 
known classifications of text, graphic, or photographic) may be generated, for example 
by using an image processing system that implements Equation 3 or 4 in an automated 
fashion (for example, in software). Then the CRs may be analyzed to determine the 

1 5 general range of CR values for the different classes of the analyzed images. With a 
sufficiently representative sample set of images, the results may be applied to other, 
unprocessed images of unknown classification. Refinement of the image thresholds 
may be made by using multiple samples and/or larger sample sets. 

[0064] Fig. 1 1 illustrates an image classification system 150. The system 150 
20 includes a classification processor 152 that receives image data 154 (such as a 

photographic, graphic, or text image). The classification processor performs processing 
(that will be described in greater detail) and produces a classification value or output 
1 56. The classification processor 1 52 may be implemented using the same hardware 
illustrated in Fig. 2 and as described with respect to the image processor 14. 

25 [0065] Fig. 12 illustrates a processing procedure that may be implemented by the 
classification processor 1 52. The processor 1 52 may first retrieve an image (for 
example from memory or from an image capture device such as a scanner or digital 
camera), as shown at step 1 60. Once the image has been obtained, the classification 
processor retrieves one or more classification thresholds (such as the thresholds T and P, 

30 which may, for example, be stored in memory) at step 162. Once the desired thresholds 

are obtained, the classification processor 152 determines the CR (at step 164) for the 

image obtained at step 160. At step 166, the classification processor 166 compares the 
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CR to the one or more thresholds of interest. In general, it is preferable that use of the 
three thresholds discussed be implemented, but one, two, three, or more thresholds 
could be used depending on the level of classification needed. For example, if it is 
desired to know only whether an image is a text image or not, than only a single 
5 threshold is needed. Likewise, if it is desired to classify images into four classes, for 

example, text, graphic, photographic night, and photographic day, three thresholds may 
be determined using the training procedure noted above and applied in the classification 
processor 152. 

[0066] The process of determining the CR for an image being analyzed in the 
10 process outlined in Fig. 12 is shown in additional detail in Fig. 13. As shown at step 
175, the image of interest is processed. First, the luminance or gray scale components 
of the pixels are determined, as shown at step 177. Then, the histogram, or more 
specifically, the histogram bins are initialized to zero, as shown at step 179. Next, the 
histogram is populated (step 181). Steps 177, 179, and 181 maybe implemented using 
1 5 known histogram tools and techniques, including the ones discussed above. Once the 
histogram is determined, the CR for the image may be determined using Equation 3 or 
4, including the possible variants thereof as discussed herein (step 183). Finally, in the 
embodiment shown the CR may be returned to the main process illustrated in Fig 12. 
As should be apparent, the processes shown in Figs. 12 and 13 could be implemented in 
20 a variety of procedural languages using functions and procedures and parameter passing. 
Object-oriented techniques could also be used, implementing the processes in Figs. 12 
and 1 3 as one or more methods. 

[0067] One advantage of image classification performed using embodiments of the 
invention is that no manual intervention is required. In other words, the system may be 

25 automated. In addition, due to its computational simplicity, embodiments can be 

implemented in stand-alone devices that use ASICs. For example, an ASIC could be 
designed from a software prototype using hardware descriptor language. In addition, 
embodiments could also be implemented using programmable logic arrays. (These 
comments also apply to the processor 14.) The relative computational simplicity of 

30 embodiments of the classification processor 152 also makes those embodiments 

relatively fast (relatively few instructions must be executed). Thus, embodiments of the 
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invention could be implemented as a pre-scan process in document scanners so that the 
final-scan can be adjusted to suit the type of image being scanned. 

[0068] Fig. 14 illustrates a system 200 where pre-scanning or pre-imaging 
techniques and other features of embodiments of the invention may be combined. An 
5 object 202, such as a person, document, etc., may be imaged using an image capture 
device such as the camera 21 or scanner 23. The image created by the image capture 
device (such as the image 12 or 154) may then be delivered to the image classifier 152 
for classification. The classification generated by the image classifier 152, such as 
classification 156, may be used as feedback to adjust the image capture device. 

1 0 [0069] The image created by the image capture device whether upon a pre-imaging 
or upon a subsequent imaging, as modified using feedback from the image classifier 
152, may be fed to the image processor 14. The image processor 14 may apply the 
controlled AHE discussed above to generate a processed image, such as the image 16, in 
order to correct exposure problems. 

1 5 [0070] As should be apparent to one of ordinary skill in the art, the systems shown 
in the figures are models of what actual systems might be like. As noted, many of the 
modules and logical structures described are capable of being implemented in software 
executed by a microprocessor or a similar device or of being implemented in hardware 
using a variety of components including, for example, application specific integrated 

20 circuits ("ASICs"). Furthermore, throughout the specification and claims reference is 
made to the word "image." It should be understood that the processing techniques 
disclosed could be applied to portions of an image and "image" incorporates whole 
images, partial images, or parts of an image. Thus, the claims should not be limited to 
whole images or to any specific hardware or software implementation or combination of 

25 software or hardware. 

[0071] Various features and advantages of the invention are set forth in the 
following claims. 
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